21![On Structural Properties of MDPs that Bound Loss due to Shallow Planning 1 Nan Jiang1 and Satinder Singh1 and Ambuj Tewari2 Computer Science and Engineering, University of Michigan 2 On Structural Properties of MDPs that Bound Loss due to Shallow Planning 1 Nan Jiang1 and Satinder Singh1 and Ambuj Tewari2 Computer Science and Engineering, University of Michigan 2](https://www.pdfsearch.io/img/7d5a2ac277f1aeadbf466bfb079db664.jpg) | Add to Reading ListSource URL: dept.stat.lsa.umich.eduLanguage: English - Date: 2016-04-20 13:16:34
|
---|
22![Journal of Artificial Intelligence Research–1178 Submitted 12/15; publishedExploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks Journal of Artificial Intelligence Research–1178 Submitted 12/15; publishedExploiting Causality for Selective Belief Filtering in Dynamic Bayesian Networks](https://www.pdfsearch.io/img/6ee8160cb537e29df4246073e75eb96a.jpg) | Add to Reading ListSource URL: jair.orgLanguage: English - Date: 2016-04-28 15:06:13
|
---|
23![Sequential Decision Making in Repeated Coalition Formation under Uncertainty Georgios Chalkiadakis Craig Boutilier Sequential Decision Making in Repeated Coalition Formation under Uncertainty Georgios Chalkiadakis Craig Boutilier](https://www.pdfsearch.io/img/5e7898671eb170c034d678a2714759d1.jpg) | Add to Reading ListSource URL: www.intelligence.tuc.grLanguage: English - Date: 2008-02-08 15:14:59
|
---|
24![Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA Bias in Natural Actor-Critic Algorithms Philip S. Thomas Department of Computer Science, University of Massachusetts, Amherst, MAUSA](https://www.pdfsearch.io/img/e0e9cdbef63b189f8af42491fc651cab.jpg) | Add to Reading ListSource URL: psthomas.comLanguage: English - Date: 2012-10-01 18:27:53
|
---|
25![Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a Verification of Markov Decision Processes using Learning Algorithms? Tom´asˇ Br´azdil1 , Krishnendu Chatterjee2 , Martin Chmel´ık2 , Vojtˇech Forejt3 , Jan Kˇret´ınsk´y2 , Marta Kwiatkowska3 , David Parker4 , a](https://www.pdfsearch.io/img/2b20ec2bc96e5ab2fee80fc6fe4d15c3.jpg) | Add to Reading ListSource URL: www.hieratic.euLanguage: English |
---|
26![Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork Bradley Hayes and Brian Scassellati Dept. of Computer Science, Yale University Human-robot teaming has the potential Online Development of Assistive Robot Behaviors for Collaborative Manipulation and Human-Robot Teamwork Bradley Hayes and Brian Scassellati Dept. of Computer Science, Yale University Human-robot teaming has the potential](https://www.pdfsearch.io/img/310379b6ba06864d7063980250084146.jpg) | Add to Reading ListSource URL: bradhayes.infoLanguage: English - Date: 2016-07-11 15:51:46
|
---|
27![Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT](https://www.pdfsearch.io/img/cf16415826f1c3641d9965c962e61a19.jpg) | Add to Reading ListSource URL: arxiv.orgLanguage: English - Date: 2015-09-14 21:25:04
|
---|
28![Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier Coordination in Multiagent Reinforcement Learning: A Bayesian Approach Georgios Chalkiadakis Craig Boutilier](https://www.pdfsearch.io/img/d7012116d1aecce5d10cec4f4b808f71.jpg) | Add to Reading ListSource URL: www.intelligence.tuc.grLanguage: English - Date: 2009-03-02 16:24:03
|
---|
29![Sutton, Richard PIN Sutton, Richard PIN](https://www.pdfsearch.io/img/347f57947a968e2c8b4faf8a06200ada.jpg) | Add to Reading ListSource URL: webdocs.cs.ualberta.caLanguage: English - Date: 2013-10-18 16:05:54
|
---|
30![Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1 Supélec, IMS-MaLIS Research group, France Learning from Demonstrations: Is It Worth Estimating a Reward Function? Bilal Piot1,2 , Matthieu Geist1 , Olivier Pietquin1,2 1 Supélec, IMS-MaLIS Research group, France](https://www.pdfsearch.io/img/f373853f2274969808bc2bc182adfbf2.jpg) | Add to Reading ListSource URL: www.ilhaire.euLanguage: English - Date: 2013-10-03 05:33:46
|
---|